An Efficient Approach for Clustering Web Access Patterns from Web Logs

نویسنده

  • Peilin Shi
چکیده

The interests of web users can be revealed by their visited web pages and time duration on these web pages during their surfing. Time duration on a web page is characterized as a fuzzy linguistic variable because linguistic variable makes users easily understand the expression of time duration and can disregard subtle difference between two time durations. Each web access pattern from web logs is transformed as corresponding fuzzy web access pattern, which is a fuzzy vector composed of fuzzy linguistic variables or 0. Each element in fuzzy web access patterns represents visited web page and time duration on this web page. This paper proposed a rough k-means clustering algorithm based on properties of rough variable to group the gained fuzzy web access patterns. Finally, an example and experiment is provided to illustrate the clustering process. Using this approach, users can effectively mine web logs records to discover interesting user access patterns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تشخیص ناهنجاری روی وب از طریق ایجاد پروفایل کاربرد دسترسی

Due to increasing in cyber-attacks, the need for web servers attack detection technique has drawn attentions today. Unfortunately, many available security solutions are inefficient in identifying web-based attacks. The main aim of this study is to detect abnormal web navigations based on web usage profiles. In this paper, comparing scrolling behavior of a normal user with an attacker, and simu...

متن کامل

Clustering Web Access Patterns Based on learning Automata

The interest of web users can be revealed by the visited web pages and time duration on these web pages during their surfing. In this paper a new method based on Learning Automata for clustering web access patterns is proposed. At the first step of the proposed algorithm, each web access pattern from web logs is transformed into a weight vector using the learning automata. In the second step a ...

متن کامل

High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences

Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...

متن کامل

A density based clustering approach to distinguish between web robot and human requests to a web server

Today world's dependence on the Internet and the emerging of Web 2.0 applications is significantly increasing the requirement of web robots crawling the sites to support services and technologies. Regardless of the advantages of robots, they may occupy the bandwidth and reduce the performance of web servers. Despite a variety of researches, there is no accurate method for classifying huge data ...

متن کامل

Expected Value of User Sessions: Limitations to the Non-Semantic Approach

Mining web access logs using a fuzzy realtional clustering algrotihm based on a robust estimator. [15] Yongjian Fu. Clustering of web users based on access patterns. INSITE: A tool for real-time knowledge discovery from users web navigation.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009